Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 68672 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.9 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Text | 4 |
| Categorical | 4 |
END_SLK is highly overall correlated with END_TRUE_DIST and 1 other fields | High correlation |
END_TRUE_DIST is highly overall correlated with END_SLK and 1 other fields | High correlation |
GEOLOCSTLength is highly overall correlated with END_SLK and 1 other fields | High correlation |
LG_NO is highly overall correlated with RA_NAME | High correlation |
NETWORK_TYPE is highly overall correlated with SPEED_LIMIT and 2 other fields | High correlation |
RA_NAME is highly overall correlated with LG_NO | High correlation |
SPEED_LIMIT is highly overall correlated with NETWORK_TYPE | High correlation |
START_SLK is highly overall correlated with NETWORK_TYPE and 1 other fields | High correlation |
START_TRUE_DIST is highly overall correlated with NETWORK_TYPE and 1 other fields | High correlation |
CWY is highly imbalanced (85.2%) | Imbalance |
NETWORK_TYPE is highly imbalanced (70.5%) | Imbalance |
SPEED_LIMIT is highly imbalanced (64.1%) | Imbalance |
START_SLK is highly skewed (γ1 = 24.64322751) | Skewed |
END_SLK is highly skewed (γ1 = 24.06224455) | Skewed |
START_TRUE_DIST is highly skewed (γ1 = 24.73083934) | Skewed |
END_TRUE_DIST is highly skewed (γ1 = 24.14762603) | Skewed |
OBJECTID is uniformly distributed | Uniform |
OBJECTID has unique values | Unique |
START_SLK has 59959 (87.3%) zeros | Zeros |
START_TRUE_DIST has 60005 (87.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-12 15:07:15.757789 |
|---|---|
| Analysis finished | 2023-12-12 15:07:24.339059 |
| Duration | 8.58 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
OBJECTID
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 68672 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84193446 |
| Minimum | 84159111 |
|---|---|
| Maximum | 84227782 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 84159111 |
|---|---|
| 5-th percentile | 84162545 |
| Q1 | 84176279 |
| median | 84193446 |
| Q3 | 84210614 |
| 95-th percentile | 84224348 |
| Maximum | 84227782 |
| Range | 68671 |
| Interquartile range (IQR) | 34335.5 |
Descriptive statistics
| Standard deviation | 19824.043 |
|---|---|
| Coefficient of variation (CV) | 0.00023545827 |
| Kurtosis | -1.2 |
| Mean | 84193446 |
| Median Absolute Deviation (MAD) | 17168 |
| Skewness | 0 |
| Sum | 5.7817324 × 1012 |
| Variance | 3.9299269 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 84159111 | 1 | < 0.1% |
| 84204898 | 1 | < 0.1% |
| 84204884 | 1 | < 0.1% |
| 84204885 | 1 | < 0.1% |
| 84204886 | 1 | < 0.1% |
| 84204887 | 1 | < 0.1% |
| 84204888 | 1 | < 0.1% |
| 84204889 | 1 | < 0.1% |
| 84204890 | 1 | < 0.1% |
| 84204891 | 1 | < 0.1% |
| Other values (68662) | 68662 |
| Value | Count | Frequency (%) |
| 84159111 | 1 | |
| 84159112 | 1 | |
| 84159113 | 1 | |
| 84159114 | 1 | |
| 84159115 | 1 | |
| 84159116 | 1 | |
| 84159117 | 1 | |
| 84159118 | 1 | |
| 84159119 | 1 | |
| 84159120 | 1 |
| Value | Count | Frequency (%) |
| 84227782 | 1 | |
| 84227781 | 1 | |
| 84227780 | 1 | |
| 84227779 | 1 | |
| 84227778 | 1 | |
| 84227777 | 1 | |
| 84227776 | 1 | |
| 84227775 | 1 | |
| 84227774 | 1 | |
| 84227773 | 1 |
ROAD
Text
| Distinct | 59783 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.8438228 |
| Min length | 4 |
Characters and Unicode
| Total characters | 469979 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 55604 ? |
|---|---|
| Unique (%) | 81.0% |
Sample
| 1st row | 2110042 |
|---|---|
| 2nd row | 2040155 |
| 3rd row | 2040149 |
| 4th row | 2040145 |
| 5th row | 2040144 |
| Value | Count | Frequency (%) |
| h009 | 200 | 0.3% |
| h006 | 177 | 0.3% |
| h005 | 152 | 0.2% |
| m031 | 104 | 0.2% |
| h001 | 99 | 0.1% |
| h043 | 84 | 0.1% |
| h002 | 70 | 0.1% |
| h017 | 70 | 0.1% |
| h007 | 67 | 0.1% |
| m037 | 61 | 0.1% |
| Other values (59773) | 67588 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 119875 | |
| 1 | 103138 | |
| 2 | 50670 | |
| 3 | 37713 | 8.0% |
| 4 | 33496 | 7.1% |
| 5 | 33086 | 7.0% |
| 6 | 24044 | 5.1% |
| 7 | 22233 | 4.7% |
| 8 | 21374 | 4.5% |
| 9 | 20775 | 4.4% |
| Other values (2) | 3575 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 466404 | |
| Uppercase Letter | 3575 | 0.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 119875 | |
| 1 | 103138 | |
| 2 | 50670 | |
| 3 | 37713 | 8.1% |
| 4 | 33496 | 7.2% |
| 5 | 33086 | 7.1% |
| 6 | 24044 | 5.2% |
| 7 | 22233 | 4.8% |
| 8 | 21374 | 4.6% |
| 9 | 20775 | 4.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 2587 | |
| M | 988 | 27.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 466404 | |
| Latin | 3575 | 0.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 119875 | |
| 1 | 103138 | |
| 2 | 50670 | |
| 3 | 37713 | 8.1% |
| 4 | 33496 | 7.2% |
| 5 | 33086 | 7.1% |
| 6 | 24044 | 5.2% |
| 7 | 22233 | 4.8% |
| 8 | 21374 | 4.6% |
| 9 | 20775 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| H | 2587 | |
| M | 988 | 27.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 469979 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 119875 | |
| 1 | 103138 | |
| 2 | 50670 | |
| 3 | 37713 | 8.0% |
| 4 | 33496 | 7.1% |
| 5 | 33086 | 7.0% |
| 6 | 24044 | 5.1% |
| 7 | 22233 | 4.7% |
| 8 | 21374 | 4.5% |
| 9 | 20775 | 4.4% |
| Other values (2) | 3575 | 0.8% |
ROAD_NAME
Text
| Distinct | 45773 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
Length
| Max length | 69 |
|---|---|
| Median length | 66 |
| Mean length | 11.256407 |
| Min length | 4 |
Characters and Unicode
| Total characters | 773000 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36897 ? |
|---|---|
| Unique (%) | 53.7% |
Sample
| 1st row | Hacket Rd |
|---|---|
| 2nd row | Knight St |
| 3rd row | Timperley Rd |
| 4th row | Mossop St |
| 5th row | Guthrie St |
| Value | Count | Frequency (%) |
| rd | 20173 | 13.2% |
| st | 13180 | 8.6% |
| wy | 4690 | 3.1% |
| pl | 4374 | 2.9% |
| ct | 3806 | 2.5% |
| dr | 2604 | 1.7% |
| l | 2526 | 1.7% |
| av | 2365 | 1.5% |
| hwy | 1954 | 1.3% |
| cl | 1756 | 1.2% |
| Other values (25126) | 95163 |
Most occurring characters
| Value | Count | Frequency (%) |
| 83942 | 10.9% | |
| e | 53734 | 7.0% |
| a | 50182 | 6.5% |
| r | 47445 | 6.1% |
| t | 44287 | 5.7% |
| n | 42499 | 5.5% |
| o | 41845 | 5.4% |
| l | 39297 | 5.1% |
| d | 38049 | 4.9% |
| i | 32188 | 4.2% |
| Other values (63) | 299532 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 531196 | |
| Uppercase Letter | 151059 | 19.5% |
| Space Separator | 83942 | 10.9% |
| Decimal Number | 1851 | 0.2% |
| Other Punctuation | 1500 | 0.2% |
| Dash Punctuation | 1183 | 0.2% |
| Open Punctuation | 1135 | 0.1% |
| Close Punctuation | 1133 | 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 53734 | |
| a | 50182 | |
| r | 47445 | 8.9% |
| t | 44287 | 8.3% |
| n | 42499 | 8.0% |
| o | 41845 | 7.9% |
| l | 39297 | 7.4% |
| d | 38049 | 7.2% |
| i | 32188 | 6.1% |
| s | 22580 | 4.3% |
| Other values (16) | 119090 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 25470 | |
| S | 20345 | |
| C | 15217 | |
| W | 10298 | 6.8% |
| P | 9450 | 6.3% |
| B | 8266 | 5.5% |
| L | 7426 | 4.9% |
| M | 7244 | 4.8% |
| H | 6324 | 4.2% |
| A | 6279 | 4.2% |
| Other values (16) | 34740 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 288 | |
| 4 | 287 | |
| 3 | 267 | |
| 2 | 179 | |
| 7 | 155 | |
| 5 | 153 | |
| 0 | 146 | |
| 8 | 144 | |
| 6 | 118 | |
| 9 | 114 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1307 | |
| ' | 148 | 9.9% |
| & | 22 | 1.5% |
| / | 13 | 0.9% |
| : | 7 | 0.5% |
| # | 3 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 83942 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1183 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1135 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1133 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 682255 | |
| Common | 90745 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 53734 | 7.9% |
| a | 50182 | 7.4% |
| r | 47445 | 7.0% |
| t | 44287 | 6.5% |
| n | 42499 | 6.2% |
| o | 41845 | 6.1% |
| l | 39297 | 5.8% |
| d | 38049 | 5.6% |
| i | 32188 | 4.7% |
| R | 25470 | 3.7% |
| Other values (42) | 267259 |
Common
| Value | Count | Frequency (%) |
| 83942 | ||
| . | 1307 | 1.4% |
| - | 1183 | 1.3% |
| ( | 1135 | 1.3% |
| ) | 1133 | 1.2% |
| 1 | 288 | 0.3% |
| 4 | 287 | 0.3% |
| 3 | 267 | 0.3% |
| 2 | 179 | 0.2% |
| 7 | 155 | 0.2% |
| Other values (11) | 869 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 773000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 83942 | 10.9% | |
| e | 53734 | 7.0% |
| a | 50182 | 6.5% |
| r | 47445 | 6.1% |
| t | 44287 | 5.7% |
| n | 42499 | 5.5% |
| o | 41845 | 5.4% |
| l | 39297 | 5.1% |
| d | 38049 | 4.9% |
| i | 32188 | 4.2% |
| Other values (63) | 299532 |
| Distinct | 45839 |
|---|---|
| Distinct (%) | 66.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
Length
| Max length | 69 |
|---|---|
| Median length | 66 |
| Mean length | 11.230341 |
| Min length | 4 |
Characters and Unicode
| Total characters | 771210 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36906 ? |
|---|---|
| Unique (%) | 53.7% |
Sample
| 1st row | Hacket Rd |
|---|---|
| 2nd row | Knight St |
| 3rd row | Timperley Rd |
| 4th row | Mossop St |
| 5th row | Guthrie St |
| Value | Count | Frequency (%) |
| rd | 20974 | 13.7% |
| st | 13395 | 8.8% |
| wy | 4686 | 3.1% |
| pl | 4377 | 2.9% |
| ct | 3806 | 2.5% |
| dr | 2650 | 1.7% |
| l | 2526 | 1.7% |
| av | 2395 | 1.6% |
| cl | 1756 | 1.1% |
| hwy | 1729 | 1.1% |
| Other values (25147) | 94564 |
Most occurring characters
| Value | Count | Frequency (%) |
| 84209 | 10.9% | |
| e | 53400 | 6.9% |
| a | 49865 | 6.5% |
| r | 46951 | 6.1% |
| t | 44463 | 5.8% |
| n | 42313 | 5.5% |
| o | 41229 | 5.3% |
| l | 38810 | 5.0% |
| d | 38732 | 5.0% |
| i | 32025 | 4.2% |
| Other values (63) | 299213 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 528736 | |
| Uppercase Letter | 151369 | 19.6% |
| Space Separator | 84209 | 10.9% |
| Decimal Number | 1888 | 0.2% |
| Other Punctuation | 1529 | 0.2% |
| Dash Punctuation | 1220 | 0.2% |
| Open Punctuation | 1130 | 0.1% |
| Close Punctuation | 1128 | 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 53400 | |
| a | 49865 | |
| r | 46951 | 8.9% |
| t | 44463 | 8.4% |
| n | 42313 | 8.0% |
| o | 41229 | 7.8% |
| l | 38810 | 7.3% |
| d | 38732 | 7.3% |
| i | 32025 | 6.1% |
| s | 22711 | 4.3% |
| Other values (16) | 118237 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 26319 | |
| S | 20633 | |
| C | 15100 | |
| W | 10295 | 6.8% |
| P | 9484 | 6.3% |
| B | 8231 | 5.4% |
| L | 7277 | 4.8% |
| M | 7007 | 4.6% |
| A | 6282 | 4.2% |
| H | 6128 | 4.0% |
| Other values (16) | 34613 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 296 | |
| 4 | 287 | |
| 3 | 277 | |
| 2 | 180 | |
| 0 | 159 | |
| 5 | 156 | |
| 7 | 153 | |
| 8 | 145 | |
| 6 | 121 | |
| 9 | 114 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1331 | |
| ' | 149 | 9.7% |
| & | 23 | 1.5% |
| / | 16 | 1.0% |
| : | 7 | 0.5% |
| # | 3 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 84209 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1220 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1130 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1128 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 680105 | |
| Common | 91105 | 11.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 53400 | 7.9% |
| a | 49865 | 7.3% |
| r | 46951 | 6.9% |
| t | 44463 | 6.5% |
| n | 42313 | 6.2% |
| o | 41229 | 6.1% |
| l | 38810 | 5.7% |
| d | 38732 | 5.7% |
| i | 32025 | 4.7% |
| R | 26319 | 3.9% |
| Other values (42) | 265998 |
Common
| Value | Count | Frequency (%) |
| 84209 | ||
| . | 1331 | 1.5% |
| - | 1220 | 1.3% |
| ( | 1130 | 1.2% |
| ) | 1128 | 1.2% |
| 1 | 296 | 0.3% |
| 4 | 287 | 0.3% |
| 3 | 277 | 0.3% |
| 2 | 180 | 0.2% |
| 0 | 159 | 0.2% |
| Other values (11) | 888 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 771210 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 84209 | 10.9% | |
| e | 53400 | 6.9% |
| a | 49865 | 6.5% |
| r | 46951 | 6.1% |
| t | 44463 | 5.8% |
| n | 42313 | 5.5% |
| o | 41229 | 5.3% |
| l | 38810 | 5.0% |
| d | 38732 | 5.0% |
| i | 32025 | 4.2% |
| Other values (63) | 299213 |
START_SLK
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 3571 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.6446895 |
| Minimum | 0 |
|---|---|
| Maximum | 3194.2 |
| Zeros | 59959 |
| Zeros (%) | 87.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 4.83 |
| Maximum | 3194.2 |
| Range | 3194.2 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 96.190029 |
|---|---|
| Coefficient of variation (CV) | 12.582595 |
| Kurtosis | 701.92989 |
| Mean | 7.6446895 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.643228 |
| Sum | 524976.12 |
| Variance | 9252.5217 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 59959 | |
| 0.09 | 110 | 0.2% |
| 0.1 | 96 | 0.1% |
| 0.08 | 84 | 0.1% |
| 0.03 | 80 | 0.1% |
| 0.02 | 75 | 0.1% |
| 0.06 | 70 | 0.1% |
| 0.07 | 65 | 0.1% |
| 0.05 | 64 | 0.1% |
| 0.04 | 61 | 0.1% |
| Other values (3561) | 8008 | 11.7% |
| Value | Count | Frequency (%) |
| 0 | 59959 | |
| 0.006 | 1 | < 0.1% |
| 0.01 | 35 | 0.1% |
| 0.011 | 1 | < 0.1% |
| 0.012 | 3 | < 0.1% |
| 0.013 | 1 | < 0.1% |
| 0.015 | 1 | < 0.1% |
| 0.02 | 75 | 0.1% |
| 0.022 | 1 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3194.2 | 1 | |
| 3194.18 | 1 | |
| 3193.98 | 1 | |
| 3193.69 | 1 | |
| 3193.45 | 1 | |
| 3189.86 | 1 | |
| 3189.34 | 1 | |
| 3188.17 | 1 | |
| 3188.03 | 1 | |
| 3186.18 | 1 |
END_SLK
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 5875 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.8074911 |
| Minimum | 0.01 |
|---|---|
| Maximum | 3194.66 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.07 |
| Q1 | 0.16 |
| median | 0.366 |
| Q3 | 1.26 |
| 95-th percentile | 19.8335 |
| Maximum | 3194.66 |
| Range | 3194.65 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 97.663996 |
|---|---|
| Coefficient of variation (CV) | 9.9581019 |
| Kurtosis | 675.2453 |
| Mean | 9.8074911 |
| Median Absolute Deviation (MAD) | 0.266 |
| Skewness | 24.062245 |
| Sum | 673500.03 |
| Variance | 9538.2561 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08 | 1764 | 2.6% |
| 0.1 | 1564 | 2.3% |
| 0.09 | 1468 | 2.1% |
| 0.12 | 1328 | 1.9% |
| 0.11 | 1293 | 1.9% |
| 0.16 | 1252 | 1.8% |
| 0.13 | 1170 | 1.7% |
| 0.15 | 1157 | 1.7% |
| 0.14 | 1137 | 1.7% |
| 0.07 | 1123 | 1.6% |
| Other values (5865) | 55416 |
| Value | Count | Frequency (%) |
| 0.01 | 46 | 0.1% |
| 0.011 | 1 | < 0.1% |
| 0.013 | 2 | < 0.1% |
| 0.015 | 3 | < 0.1% |
| 0.017 | 2 | < 0.1% |
| 0.018 | 2 | < 0.1% |
| 0.019 | 1 | < 0.1% |
| 0.02 | 153 | |
| 0.022 | 2 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3194.66 | 1 | |
| 3194.2 | 1 | |
| 3194.18 | 1 | |
| 3193.98 | 1 | |
| 3193.69 | 1 | |
| 3193.45 | 1 | |
| 3189.86 | 1 | |
| 3189.04 | 1 | |
| 3188.28 | 1 | |
| 3188.03 | 1 |
CWY
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
| Single | |
|---|---|
| Left | 1098 |
| Right | 1072 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.9524115 |
| Min length | 4 |
Characters and Unicode
| Total characters | 408764 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Single |
|---|---|
| 2nd row | Single |
| 3rd row | Single |
| 4th row | Single |
| 5th row | Single |
Common Values
| Value | Count | Frequency (%) |
| Single | 66502 | |
| Left | 1098 | 1.6% |
| Right | 1072 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| single | 66502 | |
| left | 1098 | 1.6% |
| right | 1072 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 67600 | |
| i | 67574 | |
| g | 67574 | |
| S | 66502 | |
| n | 66502 | |
| l | 66502 | |
| t | 2170 | 0.5% |
| L | 1098 | 0.3% |
| f | 1098 | 0.3% |
| R | 1072 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 340092 | |
| Uppercase Letter | 68672 | 16.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 67600 | |
| i | 67574 | |
| g | 67574 | |
| n | 66502 | |
| l | 66502 | |
| t | 2170 | 0.6% |
| f | 1098 | 0.3% |
| h | 1072 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 66502 | |
| L | 1098 | 1.6% |
| R | 1072 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 408764 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 67600 | |
| i | 67574 | |
| g | 67574 | |
| S | 66502 | |
| n | 66502 | |
| l | 66502 | |
| t | 2170 | 0.5% |
| L | 1098 | 0.3% |
| f | 1098 | 0.3% |
| R | 1072 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 408764 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 67600 | |
| i | 67574 | |
| g | 67574 | |
| S | 66502 | |
| n | 66502 | |
| l | 66502 | |
| t | 2170 | 0.5% |
| L | 1098 | 0.3% |
| f | 1098 | 0.3% |
| R | 1072 | 0.3% |
START_TRUE_DIST
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 3549 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.5768295 |
| Minimum | 0 |
|---|---|
| Maximum | 3194.55 |
| Zeros | 60005 |
| Zeros (%) | 87.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 4.61 |
| Maximum | 3194.55 |
| Range | 3194.55 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 96.104057 |
|---|---|
| Coefficient of variation (CV) | 12.683941 |
| Kurtosis | 705.91581 |
| Mean | 7.5768295 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.730839 |
| Sum | 520316.03 |
| Variance | 9235.9898 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 60005 | |
| 0.09 | 110 | 0.2% |
| 0.1 | 96 | 0.1% |
| 0.08 | 84 | 0.1% |
| 0.03 | 78 | 0.1% |
| 0.02 | 73 | 0.1% |
| 0.06 | 69 | 0.1% |
| 0.05 | 65 | 0.1% |
| 0.07 | 65 | 0.1% |
| 0.17 | 62 | 0.1% |
| Other values (3539) | 7965 | 11.6% |
| Value | Count | Frequency (%) |
| 0 | 60005 | |
| 0.01 | 32 | < 0.1% |
| 0.011 | 1 | < 0.1% |
| 0.012 | 2 | < 0.1% |
| 0.013 | 1 | < 0.1% |
| 0.015 | 1 | < 0.1% |
| 0.02 | 73 | 0.1% |
| 0.022 | 1 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| 0.024 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3194.55 | 1 | |
| 3194.53 | 1 | |
| 3194.33 | 1 | |
| 3194.04 | 1 | |
| 3193.8 | 1 | |
| 3190.21 | 1 | |
| 3189.69 | 1 | |
| 3188.82 | 1 | |
| 3188.57 | 1 | |
| 3186.72 | 1 |
END_TRUE_DIST
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 5837 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.739631 |
| Minimum | 0.01 |
|---|---|
| Maximum | 3195.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.06955 |
| Q1 | 0.16 |
| median | 0.365 |
| Q3 | 1.25 |
| 95-th percentile | 19.609 |
| Maximum | 3195.01 |
| Range | 3195 |
| Interquartile range (IQR) | 1.09 |
Descriptive statistics
| Standard deviation | 97.577041 |
|---|---|
| Coefficient of variation (CV) | 10.018556 |
| Kurtosis | 679.10263 |
| Mean | 9.739631 |
| Median Absolute Deviation (MAD) | 0.265 |
| Skewness | 24.147626 |
| Sum | 668839.94 |
| Variance | 9521.2789 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08 | 1765 | 2.6% |
| 0.1 | 1563 | 2.3% |
| 0.09 | 1468 | 2.1% |
| 0.12 | 1330 | 1.9% |
| 0.11 | 1293 | 1.9% |
| 0.16 | 1252 | 1.8% |
| 0.13 | 1169 | 1.7% |
| 0.15 | 1157 | 1.7% |
| 0.14 | 1137 | 1.7% |
| 0.07 | 1123 | 1.6% |
| Other values (5827) | 55415 |
| Value | Count | Frequency (%) |
| 0.01 | 46 | 0.1% |
| 0.011 | 1 | < 0.1% |
| 0.013 | 2 | < 0.1% |
| 0.015 | 3 | < 0.1% |
| 0.017 | 2 | < 0.1% |
| 0.018 | 2 | < 0.1% |
| 0.019 | 1 | < 0.1% |
| 0.02 | 153 | |
| 0.022 | 2 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3195.01 | 1 | |
| 3194.55 | 1 | |
| 3194.53 | 1 | |
| 3194.33 | 1 | |
| 3194.04 | 1 | |
| 3193.8 | 1 | |
| 3190.21 | 1 | |
| 3189.69 | 1 | |
| 3188.82 | 1 | |
| 3188.57 | 1 |
NETWORK_TYPE
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
| Local Road | |
|---|---|
| State Road | 3575 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 686720 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Local Road |
|---|---|
| 2nd row | Local Road |
| 3rd row | Local Road |
| 4th row | Local Road |
| 5th row | Local Road |
Common Values
| Value | Count | Frequency (%) |
| Local Road | 65097 | |
| State Road | 3575 | 5.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| road | 68672 | |
| local | 65097 | |
| state | 3575 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 137344 | |
| o | 133769 | |
| 68672 | ||
| R | 68672 | |
| d | 68672 | |
| L | 65097 | |
| c | 65097 | |
| l | 65097 | |
| t | 7150 | 1.0% |
| S | 3575 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 480704 | |
| Uppercase Letter | 137344 | 20.0% |
| Space Separator | 68672 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 137344 | |
| o | 133769 | |
| d | 68672 | |
| c | 65097 | |
| l | 65097 | |
| t | 7150 | 1.5% |
| e | 3575 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 68672 | |
| L | 65097 | |
| S | 3575 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 68672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 618048 | |
| Common | 68672 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 137344 | |
| o | 133769 | |
| R | 68672 | |
| d | 68672 | |
| L | 65097 | |
| c | 65097 | |
| l | 65097 | |
| t | 7150 | 1.2% |
| S | 3575 | 0.6% |
| e | 3575 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 68672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 686720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 137344 | |
| o | 133769 | |
| 68672 | ||
| R | 68672 | |
| d | 68672 | |
| L | 65097 | |
| c | 65097 | |
| l | 65097 | |
| t | 7150 | 1.0% |
| S | 3575 | 0.5% |
RA_NO
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4395969 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 14 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.0114784 |
|---|---|
| Coefficient of variation (CV) | 0.46765014 |
| Kurtosis | 0.74995767 |
| Mean | 6.4395969 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.17102201 |
| Sum | 442220 |
| Variance | 9.0690022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 36010 | |
| 2 | 10418 | 15.2% |
| 8 | 9347 | 13.6% |
| 1 | 4056 | 5.9% |
| 14 | 3914 | 5.7% |
| 5 | 2241 | 3.3% |
| 11 | 1673 | 2.4% |
| 6 | 1013 | 1.5% |
| Value | Count | Frequency (%) |
| 1 | 4056 | 5.9% |
| 2 | 10418 | 15.2% |
| 5 | 2241 | 3.3% |
| 6 | 1013 | 1.5% |
| 7 | 36010 | |
| 8 | 9347 | 13.6% |
| 11 | 1673 | 2.4% |
| 14 | 3914 | 5.7% |
| Value | Count | Frequency (%) |
| 14 | 3914 | 5.7% |
| 11 | 1673 | 2.4% |
| 8 | 9347 | 13.6% |
| 7 | 36010 | |
| 6 | 1013 | 1.5% |
| 5 | 2241 | 3.3% |
| 2 | 10418 | 15.2% |
| 1 | 4056 | 5.9% |
RA_NAME
Categorical
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
| Metropolitan | |
|---|---|
| South West | |
| Wheatbelt | |
| Great Southern | |
| Mid West-Gascoyne | |
| Other values (3) |
Length
| Max length | 22 |
|---|---|
| Median length | 12 |
| Mean length | 11.851628 |
| Min length | 7 |
Characters and Unicode
| Total characters | 813875 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | South West |
|---|---|
| 2nd row | South West |
| 3rd row | South West |
| 4th row | South West |
| 5th row | South West |
Common Values
| Value | Count | Frequency (%) |
| Metropolitan | 36010 | |
| South West | 10418 | 15.2% |
| Wheatbelt | 9347 | 13.6% |
| Great Southern | 4056 | 5.9% |
| Mid West-Gascoyne | 3914 | 5.7% |
| Goldfields - Esperance | 2241 | 3.3% |
| Pilbara | 1673 | 2.4% |
| Kimberley | 1013 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| metropolitan | 36010 | |
| south | 10418 | 11.4% |
| west | 10418 | 11.4% |
| wheatbelt | 9347 | 10.2% |
| great | 4056 | 4.4% |
| southern | 4056 | 4.4% |
| mid | 3914 | 4.3% |
| west-gascoyne | 3914 | 4.3% |
| goldfields | 2241 | 2.4% |
| 2241 | 2.4% | |
| Other values (3) | 4927 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 123576 | |
| o | 92649 | |
| e | 89811 | |
| a | 58914 | 7.2% |
| l | 52525 | 6.5% |
| r | 49049 | 6.0% |
| n | 46221 | 5.7% |
| i | 44851 | 5.5% |
| M | 39924 | 4.9% |
| p | 38251 | 4.7% |
| Other values (17) | 178104 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 691635 | |
| Uppercase Letter | 93215 | 11.5% |
| Space Separator | 22870 | 2.8% |
| Dash Punctuation | 6155 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 123576 | |
| o | 92649 | |
| e | 89811 | |
| a | 58914 | |
| l | 52525 | |
| r | 49049 | 7.1% |
| n | 46221 | 6.7% |
| i | 44851 | 6.5% |
| p | 38251 | 5.5% |
| h | 23821 | 3.4% |
| Other values (8) | 71967 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 39924 | |
| W | 23679 | |
| S | 14474 | 15.5% |
| G | 10211 | 11.0% |
| E | 2241 | 2.4% |
| P | 1673 | 1.8% |
| K | 1013 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 22870 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6155 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 784850 | |
| Common | 29025 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 123576 | |
| o | 92649 | |
| e | 89811 | |
| a | 58914 | 7.5% |
| l | 52525 | 6.7% |
| r | 49049 | 6.2% |
| n | 46221 | 5.9% |
| i | 44851 | 5.7% |
| M | 39924 | 5.1% |
| p | 38251 | 4.9% |
| Other values (15) | 149079 |
Common
| Value | Count | Frequency (%) |
| 22870 | ||
| - | 6155 | 21.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 813875 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 123576 | |
| o | 92649 | |
| e | 89811 | |
| a | 58914 | 7.2% |
| l | 52525 | 6.5% |
| r | 49049 | 6.0% |
| n | 46221 | 5.7% |
| i | 44851 | 5.5% |
| M | 39924 | 4.9% |
| p | 38251 | 4.7% |
| Other values (17) | 178104 |
LG_NO
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 139 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 241.42469 |
| Minimum | 1 |
|---|---|
| Maximum | 814 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 109 |
| median | 131 |
| Q3 | 315 |
| 95-th percentile | 608 |
| Maximum | 814 |
| Range | 813 |
| Interquartile range (IQR) | 206 |
Descriptive statistics
| Standard deviation | 187.76893 |
|---|---|
| Coefficient of variation (CV) | 0.77775365 |
| Kurtosis | 1.1317248 |
| Mean | 241.42469 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | 1.3950741 |
| Sum | 16579116 |
| Variance | 35257.171 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 110 | 4473 | 6.5% |
| 109 | 3207 | 4.7% |
| 131 | 3194 | 4.7% |
| 107 | 2746 | 4.0% |
| 125 | 2671 | 3.9% |
| 103 | 2523 | 3.7% |
| 104 | 2150 | 3.1% |
| 212 | 2126 | 3.1% |
| 101 | 1869 | 2.7% |
| 114 | 1534 | 2.2% |
| Other values (129) | 42179 |
| Value | Count | Frequency (%) |
| 1 | 380 | 0.6% |
| 2 | 89 | 0.1% |
| 3 | 209 | 0.3% |
| 4 | 336 | 0.5% |
| 101 | 1869 | |
| 102 | 1271 | |
| 103 | 2523 | |
| 104 | 2150 | |
| 105 | 1073 | |
| 106 | 1010 |
| Value | Count | Frequency (%) |
| 814 | 595 | |
| 813 | 535 | |
| 812 | 261 | |
| 811 | 279 | |
| 806 | 162 | 0.2% |
| 805 | 53 | 0.1% |
| 804 | 90 | 0.1% |
| 803 | 293 | |
| 707 | 77 | 0.1% |
| 706 | 65 | 0.1% |
LG_NAME
Text
| Distinct | 139 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 11.46409 |
| Min length | 3 |
Characters and Unicode
| Total characters | 787262 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Harvey |
|---|---|
| 2nd row | Bunbury (C) |
| 3rd row | Bunbury (C) |
| 4th row | Bunbury (C) |
| 5th row | Bunbury (C) |
| Value | Count | Frequency (%) |
| c | 39394 | |
| wanneroo | 4473 | 3.6% |
| 3980 | 3.2% | |
| swan | 3207 | 2.6% |
| joondalup | 3194 | 2.6% |
| rockingham | 2746 | 2.2% |
| stirling | 2671 | 2.1% |
| cockburn | 2523 | 2.0% |
| gosnells | 2150 | 1.7% |
| mandurah | 2126 | 1.7% |
| Other values (155) | 58034 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 68927 | 8.8% |
| n | 68765 | 8.7% |
| 55826 | 7.1% | |
| r | 50452 | 6.4% |
| C | 47393 | 6.0% |
| e | 46243 | 5.9% |
| o | 44992 | 5.7% |
| ( | 41435 | 5.3% |
| ) | 41435 | 5.3% |
| l | 36324 | 4.6% |
| Other values (41) | 285470 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 524068 | |
| Uppercase Letter | 120518 | 15.3% |
| Space Separator | 55826 | 7.1% |
| Open Punctuation | 41435 | 5.3% |
| Close Punctuation | 41435 | 5.3% |
| Dash Punctuation | 3980 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 68927 | |
| n | 68765 | |
| r | 50452 | |
| e | 46243 | |
| o | 44992 | 8.6% |
| l | 36324 | 6.9% |
| i | 29927 | 5.7% |
| u | 24196 | 4.6% |
| t | 22063 | 4.2% |
| d | 18730 | 3.6% |
| Other values (14) | 113449 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 47393 | |
| M | 8522 | 7.1% |
| S | 7826 | 6.5% |
| B | 7780 | 6.5% |
| W | 6952 | 5.8% |
| G | 6073 | 5.0% |
| K | 5631 | 4.7% |
| A | 4373 | 3.6% |
| J | 4249 | 3.5% |
| R | 4065 | 3.4% |
| Other values (13) | 17654 | 14.6% |
Space Separator
| Value | Count | Frequency (%) |
| 55826 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 41435 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 41435 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3980 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 644586 | |
| Common | 142676 | 18.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 68927 | 10.7% |
| n | 68765 | 10.7% |
| r | 50452 | 7.8% |
| C | 47393 | 7.4% |
| e | 46243 | 7.2% |
| o | 44992 | 7.0% |
| l | 36324 | 5.6% |
| i | 29927 | 4.6% |
| u | 24196 | 3.8% |
| t | 22063 | 3.4% |
| Other values (37) | 205304 |
Common
| Value | Count | Frequency (%) |
| 55826 | ||
| ( | 41435 | |
| ) | 41435 | |
| - | 3980 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 787262 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 68927 | 8.8% |
| n | 68765 | 8.7% |
| 55826 | 7.1% | |
| r | 50452 | 6.4% |
| C | 47393 | 6.0% |
| e | 46243 | 5.9% |
| o | 44992 | 5.7% |
| ( | 41435 | 5.3% |
| ) | 41435 | 5.3% |
| l | 36324 | 4.6% |
| Other values (41) | 285470 |
SPEED_LIMIT
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 536.6 KiB |
| 50km/h applies in built up areas or 110km/h outside built up areas | |
|---|---|
| 50km/h | |
| 60km/h | 2397 |
| 70km/h | 1676 |
| 110km/h | 1368 |
| Other values (7) | 3116 |
Length
| Max length | 66 |
|---|---|
| Median length | 66 |
| Mean length | 53.283172 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3659062 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 50km/h applies in built up areas or 110km/h outside built up areas |
|---|---|
| 2nd row | 50km/h applies in built up areas or 110km/h outside built up areas |
| 3rd row | 50km/h applies in built up areas or 110km/h outside built up areas |
| 4th row | 50km/h applies in built up areas or 110km/h outside built up areas |
| 5th row | 50km/h applies in built up areas or 110km/h outside built up areas |
Common Values
| Value | Count | Frequency (%) |
| 50km/h applies in built up areas or 110km/h outside built up areas | 54087 | |
| 50km/h | 6028 | 8.8% |
| 60km/h | 2397 | 3.5% |
| 70km/h | 1676 | 2.4% |
| 110km/h | 1368 | 2.0% |
| 80km/h | 1328 | 1.9% |
| 40km/h | 708 | 1.0% |
| 90km/h | 504 | 0.7% |
| 100km/h | 442 | 0.6% |
| 30km/h | 73 | 0.1% |
| Other values (2) | 61 | 0.1% |
Length
| Value | Count | Frequency (%) |
| built | 108174 | |
| up | 108174 | |
| areas | 108174 | |
| 50km/h | 60115 | |
| 110km/h | 55455 | |
| applies | 54087 | |
| in | 54087 | |
| or | 54087 | |
| outside | 54087 | |
| 60km/h | 2397 | 0.4% |
| Other values (8) | 4792 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 594957 | ||
| i | 270435 | 7.4% |
| u | 270435 | 7.4% |
| a | 270435 | 7.4% |
| s | 216348 | 5.9% |
| e | 216348 | 5.9% |
| p | 216348 | 5.9% |
| l | 162261 | 4.4% |
| t | 162261 | 4.4% |
| r | 162261 | 4.4% |
| Other values (18) | 1116973 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2639931 | |
| Space Separator | 594957 | 16.3% |
| Decimal Number | 301415 | 8.2% |
| Other Punctuation | 122759 | 3.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 270435 | |
| u | 270435 | |
| a | 270435 | |
| s | 216348 | 8.2% |
| e | 216348 | 8.2% |
| p | 216348 | 8.2% |
| l | 162261 | 6.1% |
| t | 162261 | 6.1% |
| r | 162261 | 6.1% |
| h | 122759 | 4.7% |
| Other values (6) | 570040 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 123201 | |
| 1 | 111404 | |
| 5 | 60115 | |
| 6 | 2397 | 0.8% |
| 7 | 1676 | 0.6% |
| 8 | 1328 | 0.4% |
| 4 | 708 | 0.2% |
| 9 | 504 | 0.2% |
| 3 | 73 | < 0.1% |
| 2 | 9 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 594957 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 122759 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2639931 | |
| Common | 1019131 | 27.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 270435 | |
| u | 270435 | |
| a | 270435 | |
| s | 216348 | 8.2% |
| e | 216348 | 8.2% |
| p | 216348 | 8.2% |
| l | 162261 | 6.1% |
| t | 162261 | 6.1% |
| r | 162261 | 6.1% |
| h | 122759 | 4.7% |
| Other values (6) | 570040 |
Common
| Value | Count | Frequency (%) |
| 594957 | ||
| 0 | 123201 | 12.1% |
| / | 122759 | 12.0% |
| 1 | 111404 | 10.9% |
| 5 | 60115 | 5.9% |
| 6 | 2397 | 0.2% |
| 7 | 1676 | 0.2% |
| 8 | 1328 | 0.1% |
| 4 | 708 | 0.1% |
| 9 | 504 | < 0.1% |
| Other values (2) | 82 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3659062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 594957 | ||
| i | 270435 | 7.4% |
| u | 270435 | 7.4% |
| a | 270435 | 7.4% |
| s | 216348 | 5.9% |
| e | 216348 | 5.9% |
| p | 216348 | 5.9% |
| l | 162261 | 4.4% |
| t | 162261 | 4.4% |
| r | 162261 | 4.4% |
| Other values (18) | 1116973 |
ROUTE_NE_ID
Real number (ℝ)
| Distinct | 59783 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36249134 |
| Minimum | 141746 |
|---|---|
| Maximum | 6.0812068 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 141746 |
|---|---|
| 5-th percentile | 203327.55 |
| Q1 | 216424 |
| median | 232832.5 |
| Q3 | 247800 |
| 95-th percentile | 2.9783466 × 108 |
| Maximum | 6.0812068 × 108 |
| Range | 6.0797893 × 108 |
| Interquartile range (IQR) | 31376 |
Descriptive statistics
| Standard deviation | 1.0412624 × 108 |
|---|---|
| Coefficient of variation (CV) | 2.8725167 |
| Kurtosis | 11.52105 |
| Mean | 36249134 |
| Median Absolute Deviation (MAD) | 15327 |
| Skewness | 3.4744386 |
| Sum | 2.4893005 × 1012 |
| Variance | 1.0842274 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 247846 | 200 | 0.3% |
| 247653 | 177 | 0.3% |
| 247651 | 152 | 0.2% |
| 247804 | 104 | 0.2% |
| 247561 | 99 | 0.1% |
| 247595 | 84 | 0.1% |
| 247815 | 70 | 0.1% |
| 247857 | 70 | 0.1% |
| 247803 | 67 | 0.1% |
| 247616 | 61 | 0.1% |
| Other values (59773) | 67588 |
| Value | Count | Frequency (%) |
| 141746 | 1 | |
| 141751 | 1 | |
| 141753 | 1 | |
| 141798 | 1 | |
| 198864 | 1 | |
| 198866 | 1 | |
| 200001 | 1 | |
| 200004 | 1 | |
| 200005 | 1 | |
| 200006 | 1 |
| Value | Count | Frequency (%) |
| 608120680 | 1 | |
| 608120662 | 1 | |
| 608101468 | 1 | |
| 603044763 | 1 | |
| 602913587 | 2 | |
| 602913586 | 2 | |
| 602913585 | 2 | |
| 602913584 | 2 | |
| 602913581 | 1 | |
| 602913580 | 1 |
GEOLOCSTLength
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 68613 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.020990581 |
| Minimum | 9.2797277 × 10-6 |
|---|---|
| Maximum | 4.4271136 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 536.6 KiB |
Quantile statistics
| Minimum | 9.2797277 × 10-6 |
|---|---|
| 5-th percentile | 0.00059783043 |
| Q1 | 0.0014754407 |
| median | 0.0032134412 |
| Q3 | 0.0088890085 |
| 95-th percentile | 0.090785221 |
| Maximum | 4.4271136 |
| Range | 4.4271043 |
| Interquartile range (IQR) | 0.0074135678 |
Descriptive statistics
| Standard deviation | 0.091321291 |
|---|---|
| Coefficient of variation (CV) | 4.3505843 |
| Kurtosis | 531.41174 |
| Mean | 0.020990581 |
| Median Absolute Deviation (MAD) | 0.0022073317 |
| Skewness | 17.935057 |
| Sum | 1441.4652 |
| Variance | 0.0083395781 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.000359972346 | 3 | < 0.1% |
| 0.0003599709388 | 3 | < 0.1% |
| 2.095407854 × 10-5 | 3 | < 0.1% |
| 0.001609987266 | 2 | < 0.1% |
| 0.0008960017839 | 2 | < 0.1% |
| 0.001131034747 | 2 | < 0.1% |
| 0.0006340218043 | 2 | < 0.1% |
| 0.0009690122772 | 2 | < 0.1% |
| 0.0008079726137 | 2 | < 0.1% |
| 0.0005470030288 | 2 | < 0.1% |
| Other values (68603) | 68649 |
| Value | Count | Frequency (%) |
| 9.279727689 × 10-6 | 1 | < 0.1% |
| 9.418095048 × 10-6 | 1 | < 0.1% |
| 9.433196005 × 10-6 | 1 | < 0.1% |
| 9.495564153 × 10-6 | 1 | < 0.1% |
| 9.495564213 × 10-6 | 2 | |
| 9.899626207 × 10-6 | 1 | < 0.1% |
| 1.08367737 × 10-5 | 1 | < 0.1% |
| 1.303228564 × 10-5 | 1 | < 0.1% |
| 1.817195024 × 10-5 | 1 | < 0.1% |
| 2.095407854 × 10-5 | 3 |
| Value | Count | Frequency (%) |
| 4.427113629 | 1 | |
| 4.234289312 | 1 | |
| 4.177167087 | 1 | |
| 4.142640033 | 1 | |
| 3.385682454 | 1 | |
| 3.305478626 | 1 | |
| 3.033943389 | 1 | |
| 2.952415007 | 1 | |
| 2.928561639 | 1 | |
| 2.866949296 | 1 |
| CWY | END_SLK | END_TRUE_DIST | GEOLOCSTLength | LG_NO | NETWORK_TYPE | OBJECTID | RA_NAME | RA_NO | ROUTE_NE_ID | SPEED_LIMIT | START_SLK | START_TRUE_DIST | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CWY | 1.000 | -0.157 | -0.155 | -0.078 | 0.066 | 0.277 | -0.182 | 0.059 | 0.017 | -0.065 | 0.265 | -0.279 | -0.275 |
| END_SLK | -0.157 | 1.000 | 1.000 | 0.921 | 0.346 | 0.300 | 0.186 | 0.118 | 0.097 | -0.099 | 0.086 | 0.433 | 0.433 |
| END_TRUE_DIST | -0.155 | 1.000 | 1.000 | 0.921 | 0.347 | 0.297 | 0.186 | 0.119 | 0.097 | -0.099 | 0.086 | 0.432 | 0.432 |
| GEOLOCSTLength | -0.078 | 0.921 | 0.921 | 1.000 | 0.314 | 0.063 | 0.087 | 0.069 | 0.085 | -0.124 | 0.042 | 0.170 | 0.169 |
| LG_NO | 0.066 | 0.346 | 0.347 | 0.314 | 1.000 | 0.070 | 0.259 | 0.896 | 0.183 | -0.107 | 0.197 | 0.138 | 0.139 |
| NETWORK_TYPE | 0.277 | 0.300 | 0.297 | 0.063 | 0.070 | 1.000 | 0.345 | 0.067 | 0.018 | 0.231 | 0.641 | 0.515 | 0.510 |
| OBJECTID | -0.182 | 0.186 | 0.186 | 0.087 | 0.259 | 0.345 | 1.000 | 0.242 | 0.104 | -0.005 | 0.465 | 0.322 | 0.321 |
| RA_NAME | 0.059 | 0.118 | 0.119 | 0.069 | 0.896 | 0.067 | 0.242 | 1.000 | 0.390 | -0.000 | 0.232 | 0.043 | 0.042 |
| RA_NO | 0.017 | 0.097 | 0.097 | 0.085 | 0.183 | 0.018 | 0.104 | 0.390 | 1.000 | -0.004 | 0.260 | 0.033 | 0.032 |
| ROUTE_NE_ID | -0.065 | -0.099 | -0.099 | -0.124 | -0.107 | 0.231 | -0.005 | -0.000 | -0.004 | 1.000 | 0.055 | 0.063 | 0.062 |
| SPEED_LIMIT | 0.265 | 0.086 | 0.086 | 0.042 | 0.197 | 0.641 | 0.465 | 0.232 | 0.260 | 0.055 | 1.000 | 0.129 | 0.128 |
| START_SLK | -0.279 | 0.433 | 0.432 | 0.170 | 0.138 | 0.515 | 0.322 | 0.043 | 0.033 | 0.063 | 0.129 | 1.000 | 0.997 |
| START_TRUE_DIST | -0.275 | 0.433 | 0.432 | 0.169 | 0.139 | 0.510 | 0.321 | 0.042 | 0.032 | 0.062 | 0.128 | 0.997 | 1.000 |
| OBJECTID | ROAD | ROAD_NAME | COMMON_USAGE_NAME | START_SLK | END_SLK | CWY | START_TRUE_DIST | END_TRUE_DIST | NETWORK_TYPE | RA_NO | RA_NAME | LG_NO | LG_NAME | SPEED_LIMIT | ROUTE_NE_ID | GEOLOCSTLength | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 84159111 | 2110042 | Hacket Rd | Hacket Rd | 0.0 | 1.15 | Single | 0.0 | 1.15 | Local Road | 2 | South West | 211 | Harvey | 50km/h applies in built up areas or 110km/h outside built up areas | 218243 | 0.010896 |
| 1 | 84159112 | 2040155 | Knight St | Knight St | 0.0 | 0.86 | Single | 0.0 | 0.86 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 223594 | 0.009268 |
| 2 | 84159113 | 2040149 | Timperley Rd | Timperley Rd | 0.0 | 1.03 | Single | 0.0 | 1.03 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 242482 | 0.011123 |
| 3 | 84159114 | 2040145 | Mossop St | Mossop St | 0.0 | 0.39 | Single | 0.0 | 0.39 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 229682 | 0.003460 |
| 4 | 84159115 | 2040144 | Guthrie St | Guthrie St | 0.0 | 0.20 | Single | 0.0 | 0.20 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 218200 | 0.001862 |
| 5 | 84159116 | 2040136 | Floreat St | Floreat St | 0.0 | 0.38 | Single | 0.0 | 0.38 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 215255 | 0.003993 |
| 6 | 84159117 | 2040132 | Dunstan St | Dunstan St | 0.0 | 1.22 | Single | 0.0 | 1.22 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 212950 | 0.011666 |
| 7 | 84159118 | 2040129 | Glenroy St | Glenroy St | 0.0 | 0.24 | Single | 0.0 | 0.24 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 217005 | 0.002672 |
| 8 | 84159119 | 2040126 | West Rd | West Rd | 0.0 | 0.33 | Single | 0.0 | 0.33 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 245510 | 0.003445 |
| 9 | 84159120 | 2040114 | Jarvis St | Jarvis St | 0.0 | 0.69 | Single | 0.0 | 0.69 | Local Road | 2 | South West | 204 | Bunbury (C) | 50km/h applies in built up areas or 110km/h outside built up areas | 221750 | 0.006389 |
| OBJECTID | ROAD | ROAD_NAME | COMMON_USAGE_NAME | START_SLK | END_SLK | CWY | START_TRUE_DIST | END_TRUE_DIST | NETWORK_TYPE | RA_NO | RA_NAME | LG_NO | LG_NAME | SPEED_LIMIT | ROUTE_NE_ID | GEOLOCSTLength | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68662 | 84227773 | 4240022 | Dulbelling South Rd | Dulbelling South Rd | 1.98 | 2.02 | Single | 1.98 | 2.02 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 212793 | 0.000373 |
| 68663 | 84227774 | 4240022 | Dulbelling South Rd | Dulbelling South Rd | 5.75 | 5.78 | Single | 5.75 | 5.78 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 212793 | 0.000273 |
| 68664 | 84227775 | 4240005 | Cubbine Rd | Cubbine Rd | 0.00 | 20.88 | Single | 0.00 | 20.88 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 210670 | 0.217490 |
| 68665 | 84227776 | 4240005 | Cubbine Rd | Cubbine Rd | 35.90 | 40.16 | Single | 35.90 | 40.16 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 210670 | 0.043219 |
| 68666 | 84227777 | 4240105 | Andrews Rd | Andrews Rd | 5.67 | 5.71 | Single | 5.67 | 5.71 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 201121 | 0.000424 |
| 68667 | 84227778 | 4240037 | Bland Rd | Bland Rd | 4.08 | 4.14 | Single | 4.08 | 4.14 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 204319 | 0.000562 |
| 68668 | 84227779 | 4240037 | Bland Rd | Bland Rd | 4.79 | 4.82 | Single | 4.79 | 4.82 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 204319 | 0.000282 |
| 68669 | 84227780 | 4240023 | Dangin South Rd | Dangin South Rd | 14.28 | 14.30 | Single | 14.28 | 14.30 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 211191 | 0.000188 |
| 68670 | 84227781 | 4240015 | Hayes Rd | Hayes Rd | 0.00 | 14.24 | Single | 0.00 | 14.24 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 219241 | 0.132928 |
| 68671 | 84227782 | 4240026 | Carter - Doodenanning Rd | Carter - Doodenanning Rd | 4.22 | 8.11 | Single | 4.22 | 8.11 | Local Road | 8 | Wheatbelt | 424 | Quairading | 110km/h | 207468 | 0.036484 |